# Japanese speech recognition

Japanese Hubert Base Phoneme Ctc
Apache-2.0
This model is a fine-tuned model for Japanese phoneme recognition using CTC based on rinna/japanese-hubert-base, which can effectively improve the accuracy of Japanese speech recognition.
Speech Recognition Transformers Japanese
J
prj-beatrice
144
3
Kotoba Whisper V2.2
Apache-2.0
Japanese automatic speech recognition model based on Whisper, integrating speaker separation and punctuation addition functions
Speech Recognition Transformers Japanese
K
kotoba-tech
22.80k
47
Kotoba Whisper V2.0
Apache-2.0
Kotoba-Whisper is a Japanese automatic speech recognition distilled model developed by Asahi Ushio in collaboration with Kotoba Technologies, based on Whisper large-v3 distillation, achieving a 6.3x inference speed improvement.
Speech Recognition Transformers Japanese
K
kotoba-tech
8,108
60
Japanese Wav2vec2 Base Rs35kh
Apache-2.0
A wav2vec 2.0 Base model fine-tuned on the large-scale Japanese automatic speech recognition corpus ReazonSpeech v2.0, suitable for Japanese automatic speech recognition tasks.
Speech Recognition Transformers Japanese
J
reazon-research
3,968
1
Parakeet Tdt Ctc 0.6b Ja
Parakeet TDT-CTC 0.6B is an automatic speech recognition (ASR) model capable of transcribing Japanese speech with punctuation, developed by the NVIDIA NeMo team.
Speech Recognition Japanese
P
nvidia
4,184
22
Kotoba Whisper V1.1
Apache-2.0
Kotoba-Whisper-v1.1 is a Japanese automatic speech recognition model based on Whisper, with added punctuation and timestamp processing capabilities.
Speech Recognition Transformers Japanese
K
kotoba-tech
476
33
Wav2vec2 Base Japanese Asr
Apache-2.0
A speech recognition model fine-tuned on the common_voice_11_0 Japanese dataset based on rinna/japanese-wav2vec2-base, supporting only hiragana output
Speech Recognition Transformers Japanese
W
TKU410410103
68
3
Kotoba Whisper V1.0
Apache-2.0
Kotoba-Whisper is a Japanese automatic speech recognition distilled Whisper model collection jointly developed by Asahi Ushio and Kotoba Technologies, which is 6.3 times faster than the original large-v3 while maintaining similar low error rates.
Speech Recognition Transformers Japanese
K
kotoba-tech
2,397
53
Whisper Small Japanese
Apache-2.0
This model is a Japanese speech recognition model fine-tuned based on openai/whisper-small, supporting Japanese speech-to-text tasks.
Speech Recognition Transformers Japanese
W
Ivydata
356
5
Wav2vec2 Large Xlsr 53 Japanese
Apache-2.0
Japanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input
Speech Recognition Transformers Japanese
W
Ivydata
19
4
Whisper Large V2 Mix Jp
Apache-2.0
An automatic speech recognition (ASR) model fine-tuned on Japanese speech datasets based on OpenAI Whisper-large-v2
Speech Recognition Transformers
W
vumichien
93
9
Kan Bayashi Csj Asr Train Asr Transformer Raw Char Sp Valid.acc.ave
This is a Japanese automatic speech recognition (ASR) model trained using the ESPnet framework, utilizing the CSJ dataset and based on the Transformer architecture.
Speech Recognition Japanese
K
espnet
13
0
W2v Hf Commonvoice From Xlsr53 Pretrain 0329UTC1500
A speech recognition model fine-tuned on the Common Voice Japanese dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers
W
qqpann
15
0
Wav2vec2 Large Xlsr Japanese Hiragana
Apache-2.0
A Japanese speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting hiragana output
Speech Recognition Transformers Japanese
W
vumichien
90
7
Wav2vec2 Large Xlsr Japanese 0325 1200
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned for Japanese speech recognition tasks based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition Transformers Japanese
W
qqpann
14
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase